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Recent advances in health and life sciences have led to generation of a large amount 
of data. To facilitate access to its desired parts, such a big mass of data has been repre- 
sented in structured forms, like biomedical ontologies. On the other hand, representing 
ontologies in a formal language, constructing them independently from each other and 
storing them at different locations have brought about many challenges for answering 
queries about the knowledge represented in these ontologies. One of the challenges for 
the users is to be able represent a complex query in a natural language, and get its 
answers in an understandable form: Currently, such queries are answered by software 
systems in a formal language, however, the majority of the users lack the necessary 
knowledge of a formal query language to represent a query; moreover, none of these 
systems can provide informative explanations about the answers. Another challenge is 
to be able to answer complex queries that require appropriate integration of relevant 
knowledge stored in different places and in various forms. 

In this work, we address the first challenge by developing an intelligent user in- 
terface that allows users to enter biomedical queries in a natural language, and that 
presents the answers (possibly with explanations if requested) in a natural language. 
We address the second challenge by developing a rule layer over biomedical ontologies 
and databases, and use automated reasoners to answer queries considering relevant parts 
of the rule layer. The main contributions of our work can be summarized as follows: 

- We introduce a controlled natural language, a subset of natural language with a re- 
stricted grammar and vocabulary, specifically for biomedical queries towards drug 
discovery; we call this controlled natural language as BioQueryCNL |4|- For 
instance, in this language, we can pose the following query: 

"What are the genes that are targeted by the drug Epinephrine and that 
interact with the gene DLG4?" 

- We present an algorithm that converts a biomedical query in BioQueryCNL into 
a program in answer set programming (ASP) — a formal framework to automate 
reasoning about knowledge (H — making use of the parsing engine APE |5|. Fig- 
ure [T] shows the overall idea behind this algorithm. For instance, according to this 
algorithm, the query above is translated into the following ASP program: 

what_be_genes (GN1 ) :- 

gene_gene (GN1, "DLG4 " ) , 
drug_gene ( "Epinephrine" , GN1 ) . 

where gene_gene and drug_gene are defined in a "rule layer". 
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Fig. 1. Transforming a query in BioQueryCNL into an ASP program. 



- Once we transform the biomedical query into an ASP program and extract the rel- 
evant part of the rule layer (also an ASP program), we can compute its answers 
(if exists) using a state-of-the-art ASP system, such as CLASP |6|, DLV 113171 or 
DLVHEX [2|, as described in \ \\. Figure [2] shows the overall idea behind this algo- 
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Fig. 2. Extracting and integrating knowledge from ontologies or databases that is relevant to a 
given query, and finding an answer to the query using an ASP system. 



rithm. For instance, using CLASP, we compute the following answer to the query 
above: "ADRB1". 

- We construct an algorithm to provide minimal explanations to the answers. For 
instance, for "ADRB1" our algorithm provides the following minimal explanation: 



the drug "Epinephrine" targets the gene "ADRB1" according to CTD and 



the gene "ADRB1" interacts with the gene "DLG4" according to BioGrid 

The applicability of our methods is illustrated with some complex queries over 
Phar mGKB DrugBank, BioGrid SiDER|and |CTD| using the ASP systems 
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